Deep C (and C++) 

by Olve Maudal and Jon Jagger 




Programming is hard. Programming correct C and C++ is particularly hard. Indeed, both in C 
and certainly in C++, it is uncommon to see a screenful containing only well defined and 
conforming code. Why do professional programmers write code like this? Because most 
programmers do not have a deep understanding of the language they are using. While they 
sometimes know that certain things are undefined or unspecified, they often do not know why 
it is so. In these slides we will study small code snippets in C and C++, and use them to discuss 
the fundamental building blocks, limitations and underlying design philosophies of these 
wonderful but dangerous programming languages. 
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Suppose you are about to interview a candidate for a position as 
C programmer for various embedded platforms. As part of the 
interview you might want to check whether the candidate has a 
deep understanding of the programming language or not... here 
is a great code snippet to get the conversation started: 
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What will happen if you try to compile, link and run this program? 
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int a = 42; 
printf ("%d\n" , a); 
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But another candidate might use this as an opportunity to start 
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You probably want to #include <Stdio.h> 
which has an explicit declaration of printf ( ). The 
program will compile, link and run, and it will write the 
number 42 followed by a newline to the standard 
output stream. 
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However a proper C compiler will create an implicit 
declaration for the function printf ( ), compile this 

code into an object file. 
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If this is C99, the exit value is defined to indicate 
success to the runtime environment, just like in 
C++98, but for older versions of C, like ANSI C 
and K&R, the exit value from this program will 
be some undefined garbage value. 



But since return values are often passed in a 
register I would not be surprised if the garbage 
value happens to be 3... since printf ( ) will 
return 3, the number of characters written to 
standard out. 
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And talking about C standards... if you want to show 
that you care about C programming, you should use 
int ma in (void) as your entry point - since the 
standard says so. 

Using void to indicate no parameters is essential for 
declarations in C, eg a declaration 'int f ( ) ; \ says 
there is a function f that takes any number of 
arguments. While you probably meant to say 
'int f ( void ) ; \ Being explicit by using void also 
for function definitions does not hurt. 
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But the code is actually undefined. 
V 



#include <stdio.h> 



void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main (void) 
{ 

foo() ; 

} 



#include <stdio.h> 



void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main (void) 
{ 

foo() ; 

} 



#include <stdio.h> 



void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main (void) 
{ 

foo() ; 

} 



a gets an undefined value 




#include <stdio.h> 



void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main (void) 
{ 

foo() ; 

} 



a gets an undefined value J 

I don't get a warning when compiling it, 

and I do get 42 




#include <stdio.h> 

void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main(void) 
{ 

foo() ; 

} 



a gets an undefined value 



I don't get a warning when compiling it, 

and I do get 42 




Then you must increase the warning level, 
the value of a is certainly undefined after 
the assignment and increment because you 
violate one of the fundamental rules in C 
(and C++). The rules for sequencing says 
that you can only update a variable once 
between sequence points. Here you try to 
update it two times, and this causes a to 
become undefined. 




#include <stdio.h> 

void foo(void) 
{ 

int a = 41; 
a = a++; 

printf ("%d\n" , a); 

} 

int main(void) 
{ 

foo() ; 

} 




a gets an undefined value 



I don't get a warning when compiling it, 

and I do get 42 



Then you must increase the warning level, 
the value of a is certainly undefined after 
the assignment and increment because you 
violate one of the fundamental rules in C 
(and C++). The rules for sequencing says 
that you can only update a variable once 
between sequence points. Here you try to 
update it two times, and this causes a to 
become undefined. 




So you say a can be whatever? But I do get 42 



#include <stdio.h> 

void foo(void) 
{ 

int a = 41; 
a = a++; 
printf ("%d\n" , 

} 

int main(void) 
{ 

foo() ; 

} 



a) ; 




a gets an undefined value 



I don't get a warning when compiling it, 

and I do get 42 



Then you must increase the warning level, 
the value of a is certainly undefined after 
the assignment and increment because you 
violate one of the fundamental rules in C 
(and C++). The rules for sequencing says 
that you can only update a variable once 
between sequence points. Here you try to 
update it two times, and this causes a to 
become undefined. 



So you say a can be whatever? But I do get 42 





Indeed! a can be 42, 4 1 , 43, 0, 1 099, or whatever... I am not surprised that your machine gives 
you 42... what else can it be here? Or perhaps the compiler choose 42 whenever a value is 
undefined ;-) 



So what about this code snippet? 



So what about this code snippet? 

#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 



#include <stdio.h> 








int b(void) { puts("3") ; 
int c(void) { puts("4") ; 


return 
return 


3; 
4; 


} 
} 


int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 















\ 



#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 




#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 



Easy, it prints 3, 4 and then 7j 



^ Actually, this could also be 4, 3 and then 7 





#include <stdio.h> 




int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 




Easy, it prints 3, 4 and then 7j 



^ Actually, this could also be 4, 3 and then 7 



Huh? Is the evaluation order undefined? 



□ 




#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 



int 
{ 



mai n (voi d) 

int a = b() + c() ; 
printf ("%d\n" , a); 



} 





Easy, it prints 3, 4 and then 7j 



^ Actually, this could also be 4, 3 and then 7 



Huh? Is the evaluation order undefined? 



□ 




^ It is not really undefined, it is unspecified 




#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 




Easy, it prints 3, 4 and then 7j 



^ Actually, this could also be 4, 3 and then 7 



Huh? Is the evaluation order undefined? 



□ 






^ It is not really undefined, it is unspecified 
Well, whatever. Lousy compilers. I think it should give us a warning? J 



#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 




Easy, it prints 3, 4 and then 7j 



^ Actually, this could also be 4, 3 and then 7 



Huh? Is the evaluation order undefined? 



□ 





^ It is not really undefined, it is unspecified 
Well, whatever. Lousy compilers. I think it should give us a warning? J 




A warning about what? 




#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 



#include <stdio.h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main(void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 



#i nclude <stdi o . h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main (void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 

The evaluation order of most expressions in C and C++ are unspecified, the 
compiler can choose to evaluate them in the order that is most optimal for 
the target platform. This has to do with sequencing again. 

The code is conforming.This code will either print 3, 4, 7 or 4, 3, 7, depending 

on the compiler. 



#i nclude <stdi o . h> 



int b(void) { puts("3"); return 3; } 
int c(void) { puts("4"); return 4; } 

int main (void) 
{ 

int a = b() + c() ; 
printf ("%d\n" , a); 

} 

The evaluation order of most expressions in C and C++ are unspecified, the 
compiler can choose to evaluate them in the order that is most optimal for 
the target platform. This has to do with sequencing again. 

The code is conforming.This code will either print 3, 4, 7 or 4, 3, 7, depending 

on the compiler. 




At this point I think he has just revealed a shallow 
understanding of C programming, while she has 
excelled in her answers so far. 



So what is it that she seems to understand better than most? 



So what is it that she seems to understand better than most? 



• Declaration and Definition 



So what is it that she seems to understand better than most? 




Declaration and Definition 

Calling conventions and activation frames 



So what is it that she seems to understand better than most? 




Declaration and Definition 

Calling conventions and activation frames 

Sequence points 



So what is it that she seems to understand better than most? 




Declaration and Definition 

Calling conventions and activation frames 

Sequence points 

Memory model 



So what is it that she seems to understand better than most? 




• Declaration and Definition 

• Calling conventions and activation frames 

• Sequence points 

• Memory model 

• Optimization 



So what is it that she seems to understand better than most? 




Declaration and Definition 

Calling conventions and activation frames 

Sequence points 

Memory model 

Optimization 

Knowledge of different C standards 



We'd like to share some things about: 




Sequence points 
Different C standards 



What do these code snippets print? 



What do these code snippets print? 



1 




What do these code snippets print? 



1 


int 


a=41; 


a++; printf ("%d\n", a) ; 










2 


int 


a=41; 


a++ & printf ("%d\n" f a); 



What do these code snippets print? 



1 


int 


a=41; 


a++; 


printf ("%d\n", a) ; 












2 


int 


a=41; 


a++ 


& printf ("%d\n" f a) ; 












3 


int 


a=41; 


a++ 


ScSc printf ("%d\n" f a) ; 



What do these code snippets print? 



1 


int 


a=41; 


a++; printf ("%d\n" f a) ; 










2 


int 


a=41; 


a++ & printf ("%d\n" f a) ; 










3 


int 


a=41; 


a++ && printf ("%d\n" f a) ; 










4 


int 


a=41; 


if (a++ < 42) printf ("%d\n" f a); 



What do these code snippets print? 



1 


int 


a= 


41; 


a++; printf ("%d\n" f a) ; 












2 


int 


a= 


=41; 


a++ & printf ("%d\n" f a); 












3 


int 


a= 


=41; 


a++ 8c8c printf ( %d\n" , a); 












4 


int 


a= 


=41; 


if (a++ < 42) printf ("%d\n", a) ; 












5 


int 


a= 


=41; 


a = a++; printf ( "%d\n" , a); 



What do these code snippets print? 



1 


int 


a= 


41; 


a++; printf ("%d\n" f a) ; 












2 


int 


a= 


=41; 


a++ & printf ("%d\n" f a); 












3 


int 


a= 


=41; 


a++ 8c8c printf ( %d\n" , a); 












4 


int 


a= 


=41; 


if (a++ < 42) printf ("%d\n", a) ; 












5 


int 


a= 


=41; 


a = a++; printf ( "%d\n" , a); 



What do these code snippets print? 



1 

2 



3 




int a=41; a++ & printf ( "%d\n" , a); 



42 



undefined 




4 




5 



int a=41; a = a++; printf ( "%d\n" , a); 



What do these code snippets print? 



1 

2 



3 




int a=41; a++ & printf ( "%d\n" , a); 




42 



undefined 



42 



4 




5 



int a=41; a = a++; printf ( "%d\n" , a); 



What do these code snippets print? 



1 

2 



3 




int a=41; a++ & printf ( "%d\n" , a); 




42 



undefined 



42 



4 




42 



5 




What do these code snippets print? 



1 


int 


a= 


41; 


a++; printf ("%d\n" f a) ; 












2 


int 


a= 


=41; 


a++ & printf ("%d\n" f a); 












3 


int 


a= 


=41; 


a++ 8c8c printf ( %d\n" , a); 












4 


int 


a= 


=41; 


if (a++ < 42) printf ("%d\n", a) ; 












5 


int 


a= 


=41; 


a = a++; printf ( "%d\n" , a); 



What do these code snippets print? 



1 

2 



3 




int a=41; a++ & printf ( "%d\n" , a); 



4 



int a=41; if (a++ < 42) printf ( "%d\n" , a); 




42 



undefined 



42 



42 



5 



int a=41; a = a++; printf ( "%d\n" , a); 



undefined 



l/l//?en exactly do side-effects take place in C and C++? 



Sequence Points 

A sequence point is a point in the program's 
execution sequence where all previous side- 
effects shall have taken place and where all 
subsequent side-effects shall not have taken pi 
(5.1.2.3) 



Sequence Points - Rule I 



Between the previous and next sequence point an 
object shall have its stored value modified at most 
once by the evaluation of an expression. (6.5) 




Sequence Points - Rule 2 



Furthermore, the prior value shall be read only to 
determine the value to be stored. (6.5) 




Sequence Points 

A lot of developers think C has many sequence points 



Sequence Points 

The reality is that C has very few sequence points. 




This helps to maximize optimization opportunities 
for the compiler. 



/* K&R C */ 

void say_it(a, s) 
int a; 
char s [] ; 

{ 

printf ("%s %d\n", s, a) ; 

} 

main( ) 
{ 

int a = 42; 

puts ("Welcome to classic C"); 
say_it(a, "the answer is"); 

} 



/* C89 */ 

void say_it(int a, char * s) 
{ 

printf ("%s %d\n", s, a) ; 

} 

main( ) 
{ 

int a = 42; 

putsC'Welcome to C89") ; 
say_it(a, "the answer is"); 

} 



// C++ (C++98) 



#include <cstdio> 

struct X { 
int a; 

const char * s; 

explicit X(const char * s, int a = 42) 

: a(a), s(s) {} 
void say_it() const { 

std: :printf ("%s %d\n", s, a); 

} 

}; 

int mainO 
{ 

XC'the answer is") . say_it( ) ; 

} 



// C99 




struct X 




{ 




int a; 




char * s; 

}; 




int main(void) 




{ 




putsC'Welcome to C99") ; 




struct X x = { . s = "the 


answer is", .a = 42 }; 


printf("%s %d\n", x.s, x. 


a); 


} 





Let's get back to our two developers... 



So what about this code snippet? 



So what about this code snippet? 



#include <stdio.h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)); 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt ) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)) 



#i nclude <stdi o . h> 

struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 
pri ntf ( "%d\n" , si zeof (char )) ; 
printf ("%d\n" , si zeof (struct X)); 





#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)); 



It will print 4, I and 12 



Q Indeed, it is exactly what I get on my machine 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 



pri ntf ( "%d\n" , si zeof (char )) ; 
printf ("%d\n" , si zeof (struct X)); 



} 




It will print 4, I and 12 J 

Q Indeed, it is exactly what I get on my machine 

Well of course, because sizeof returns the number of bytes. And in C 
int is 32 bits or 4 bytes, char is one byte and when the the size of structs 
^ are always rounded up to multiples of 4 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 



pri ntf ( "%d\n" , si zeof (char )) ; 
printf ("%d\n" , si zeof (struct X)); 



} 




It will print 4, I and 12 J 

Q Indeed, it is exactly what I get on my machine 

Well of course, because sizeof returns the number of bytes. And in C 
int is 32 bits or 4 bytes, char is one byte and when the the size of structs 
^ are always rounded up to multiples of 4 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)); 



It will print 4, I and 12 



Indeed, it is exactly what I get on my machine 




Well of course, because sizeof returns the number of bytes. And in C 
int is 32 bits or 4 bytes, char is one byte and when the the size of structs 

are always rounded up to multiples of 4 




do you want another ice cream? 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)); 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 

pri ntf ( "%d\n" , si zeof (char )) ; 

printf ("%d\n" , si zeof (struct X)); 



#include <stdio.h> 



struct X { int a; char b; int c; }; 

int main(void) 
{ 

printf ("%d\n" , sizeof (int)) ; 



pri ntf ( "%d\n" , si zeof (char) ) ; 
printf ("%d\n" , si zeof (struct X)); 



} 



Hmm... first of all, let's fix the code. The return type of sizeof is 
size_t which is not the same as int, so %d is a poor specifier to use in 

the format string for printf here 



#include <stdio.h> 



struct X { int a; char b; int c; }; 

int main(void) 
{ 

printf ("%d\n" , sizeof (int)) ; 



pri ntf ( "%d\n" , si zeof (char) ) ; 
printf ("%d\n" , si zeof (struct X)); 



} 



Hmm... first of all, let's fix the code. The return type of sizeof is 
size_t which is not the same as int, so %d is a poor specifier to use in 

the format string for printf here 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 



i nt 
{ 



mai n (voi d) 

printf ("%d\n M , 
printf ("%d\n" , 
printf ("%d\n" , 



si zeof (int)) ; 
si zeof (char ) ) ; 
si zeof (struct X)) 



} 



Hmm... first of all, let's fix the code. The return type of sizeof is 
size_t which is not the same as int, so %d is a poor specifier to use in 

the format string for printf here 



Q ok, what should specifier should we use? 




Thats a bit tricky. size_t is an unsigned integer type, but on say 32-bit 
machines it is usually an unsigned int and on 64-bit machines it is usually an 
unsigned long. In C99 however, they introduced a new specifier for printing 

size_t values, so %ZU might be an option. 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 



i nt 
{ 



mai n (voi d) 

printf ("%d\n M , 
printf ("%d\n" , 
printf ("%d\n" , 



si zeof (int)) ; 
si zeof (char ) ) ; 
si zeof (struct X)) 



} 



Hmm... first of all, let's fix the code. The return type of sizeof is 
size_t which is not the same as int, so %d is a poor specifier to use in 

the format string for printf here 



Q ok, what should specifier should we use? 




Thats a bit tricky. size_t is an unsigned integer type, but on say 32-bit 
machines it is usually an unsigned int and on 64-bit machines it is usually an 
unsigned long. In C99 however, they introduced a new specifier for printing 

size_t values, so %ZU might be an option. 



^ok, let's fix the p rint f issue, and then you can try to answer the question 



#i nclude <stdi o . h> 

struct X { int a; char b; int c; }; 

int main (void) 
{ 

printf ("%d\n" , si zeof ( i nt) ) ; 
pri ntf ( "%d\n" , si zeof (char )) ; 
printf ("%d\n" , si zeof (struct X)); 





#i nclude <stdi o . h> 

struct X { int a; char b; int c; }; 

int main (void) 
{ 

pr i ntf ( "%zu\n " , si zeof ( i nt ) ) ; 

pr i ntf ( "%zu\n " , si zeof (char )) ; 

printf ("%zu\n" , si zeof (struct X)); 





#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int main (void) 
{ 

pr i ntf ( "%zu\n " , si zeof ( i nt) ) ; 
pr i ntf ( "%zu\n " , si zeof (char )) ; 



pr i ntf ( "%zu\n " , si zeof (struct X)); 



} 




Now it all depends on the platform and the compile time options 
provided. The only thing we know for sure is that sizeof char is I . Do 

you assume a 64-bit machine? 
V 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int mai n (voi d) 
{ 

pr i ntf ( "%zu\n " , si zeof ( i nt ) ) ; 

pr i ntf ( "%zu\n " , si zeof (char )) ; 

pr i ntf ( "%zu\n " , si zeof (struct X)); 

} 




Now it all depends on the platform and the compile time options 
provided. The only thing we know for sure is that sizeof char is I . Do 

you assume a 64-bit machine? 



Q Yes, I have a 64-bit machine running in 32-bit compatibility mode. 




#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 



i nt 
{ 



mai n (voi d) 

printf ("%zu\n" , 
printf ("%zu\n" , 
printf ("%zu\n" , 



si zeof (int)) ; 

si zeof (char ) ) ; 

si zeof (struct X) ) ; 



} 




Now it all depends on the platform and the compile time options 
provided. The only thing we know for sure is that sizeof char is I . Do 

you assume a 64-bit machine? 

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

Q Yes, I have a 64-bit machine running in 32-bit compatibility mode. 



Then I would like to guess that this prints 4, 1 , 1 2 due to word alignment ^ 




#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 



i nt 
{ 



mai n (voi d) 

printf ("%zu\n" , 
printf ("%zu\n" , 
printf ("%zu\n" , 



si zeof (int)) ; 

si zeof (char ) ) ; 

si zeof (struct X) ) ; 



} 




Now it all depends on the platform and the compile time options 
provided. The only thing we know for sure is that sizeof char is I . Do 

you assume a 64-bit machine? 

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

Q Yes, I have a 64-bit machine running in 32-bit compatibility mode. 



Then I would like to guess that this prints 4, 1 , 1 2 due to word alignment ^ 





But that of course also depends also on compilation flags. It could be 4, 1 , 9 if 
you ask the compiler to pack the structs, eg -f pack-struct in gcc. 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int mai n (voi d) 
{ 

printf ("%zu\n" , sizeof (int)) ; 

pr i ntf ( "%zu\n " , si zeof (char ) ) ; 

pr i ntf ( "%zu\n " , si zeof (struct X)); 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 

int mai n (voi d) 
{ 

pr i ntf ( "%zu\n " , si zeof ( i nt ) ) ; 

pr i ntf ( "%zu\n " , si zeof (char )) ; 

pr i ntf ( "%zu\n " , si zeof (struct X)); 

} 



Q 4, I, 12 is indeed what I get on my machine. Why 12? 



#i nclude <stdi o . h> 



struct X { int a; char b; int c; }; 



i nt 
{ 



} 



mai n (voi d) 

printf ("%zu\n" , 
printf ("%zu\n" , 
printf ("%zu\n" , 



si zeof (int)) ; 
si zeof (char ) ) ; 
si zeof (struct X)) 



c 



4, I, 12 is indeed what I get on my machine. Why 12? 




It is very expensive to work on subword data types, so the compiler will optimize 
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4, I, 12 is indeed what I get on my machine. Why 12? 




It is very expensive to work on subword data types, so the compiler will optimize 
the code by making sure that C is on a word boundary by adding some padding.Also 
elements in an array of St ruct X will now align on word-boundaries. 

^ Why is it expensive to work on values that are not aligned? 




The instruction set of most processors are optimized for moving a word of data between 
memory and CPU. Suppose you want to change a value crossing a word boundary, you would 
need to read two words, mask out the value, change the value, mask and write back two words. 

Perhaps 10 times slower. Remember, C is focused on execution speed. 
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efficient size, what if you have an array of St ruct X objects? But if you add it 

just after char b, then 1 2 is a more plausible answer. 
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If you add it to the end of the struct, my guess is that the size of the struct 
becomes 1 6 on your machine. This is first of all because 1 3 would be a not so 
efficient size, what if you have an array of St ruct X objects? But if you add it 

just after char b, then 1 2 is a more plausible answer. 

So why doesn't the compiler reorder the members in the structure to 
optimize memory usage, and execution speed? 
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If you add it to the end of the struct, my guess is that the size of the struct 
becomes 1 6 on your machine. This is first of all because 1 3 would be a not so 
efficient size, what if you have an array of St ruct X objects? But if you add it 

just after char b, then 1 2 is a more plausible answer. 
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So why doesn't the compiler reorder the members in the structure to 

optimize memory usage, and execution speed? 




Some languages actually do that, but C and C++ don't. 
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You said your runtime was 64-bit, so a pointer is probably 8 bytes... Maybe the 
struct becomes 20? But perhaps the 64-bit pointer also needs alignment for 

efficiency? Maybe this code will print 4, 1 ,24? 
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struct X { int a; char b; int c; }; 

int mai n (voi d) 
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pr i ntf ( "%zu\n " , si zeof ( i nt ) ) ; 

pr i ntf ( "%zu\n " , si zeof (char )) ; 

pr i ntf ( "%zu\n " , si zeof (struct X)); 

} 



Q so what if I add achar * dto the end of the struct? 
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You said your runtime was 64-bit, so a pointer is probably 8 bytes... Maybe the 
struct becomes 20? But perhaps the 64-bit pointer also needs alignment for 

efficiency? Maybe this code will print 4, 1 ,24? 
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Nice answer! It does not matter what I actually get on my 
machine. I like your argument and your insight. 
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• Some experience with 32-bit vs 64-bit issues 

• Memory alignment 

• CPU and memory optimization 

• Spirit of C 



We'd like to share some things about: 




Memory model 
Optimization 
The spirit of C 



Memory Model 



static storage 

An object whose identifier is declared with external or internal 
linkage, or with the storage-class specifier static has static 
storage duration. It's lifetime is the entire execution of the 
program... (6.2.4) 

int * immortal (void) 
{ 

static int storage = 42; 
return Sstorage; 

} 



Memory Model 



automatic storage 

An object whose identifier is declared with no linkage and 
without the storage-class specifier static has automatic 
storage duration. ... It's lifetime extends from entry into the 
block with which it is associated until execution of that block 
ends in any way. (6.2.4) 



int 
{ 



zombie (void) 



auto int storage = 42; 



return &storage 



Memory Model 



allocated storage 

...storage allocated by calls to calloc, malloc, and realloc... 
The lifetime of an allocated object extends from the allocation 
to the dealloction. (7.20.3) 



int 


* finite (void) 


{ 






int * ptr = malloc ( sizeof *ptr) ; 




*ptr = 42; 




return ptr; 


} 





Optimization 



By default you should compile with optimization 
on . Forcing the compiler to work harder helps it 
find more potential problems. 



opt . c 



# include <stdio . h> 



int main (void) 



{ 



int a; 

printf ("%d\n", 



a) ; 



>cc -Wall opt.c 



L 



no warning! 



i 



opt . c 



# include <stdio . h> 

int main (void) 
{ 

int a; 

printf ("%d\n", 

} 



a) ; 



PR 



>cc -Wall -0 opt.c 

warning: y a' is uninitialized 



The Spirit of C 



There are many facets of the spirit of C, but the 
essence is a community sentiment of the underlying 
principles upon which the C language is based 

(C Rationale Introduction) 

Trust the programmer 

Keep the language small and simple 

Provide only one way to do an operation 

Make it fast, even if it is not guaranteed to be portable 

Maintain conceptual simplicity 

Don't prevent the programmer from doing what needs 
to be done 



Let's ask our developers about C++ 



On a scale from I to 10, how do you rate your understanding of C++? 
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So what about this code snippet? 
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int main (void) 
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standard to behave just like a struct in C. 
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int main (void) 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

This struct is a POD (Plain Old Data) struct and it is guaranteed by the C++ 

standard to behave just like a struct in C. 



So on your machine? I guess this will still print 12. 



And by the way, it looks weird to specify func(void) instead of func() as void is 
the default in C++. This is also true when defining the main function. Of 
course, no kittens are hurt by this, it just looks like the code is written by a 

die-hard C programmer struggling to learn C++ 
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int main() 
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#include <iostream> 




struct X 
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i n t a ; 




char b; 




i nt 




void set_value ( i nt v) { a = v; 
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int main() 
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Eh, can you do that in C++? I think you must use a class. J 
^ What is the difference between a class and a struct in C++? 
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Eh, can you do that in C++? I think you must use a class. J 
What is the difference between a class and a struct in C++? 



Eh, in a class you can have member functions, while I don't think you can have member 
functions on structs. Or maybe you can? Is it the default visibility that is different? 
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Anyway, now this code will print 1 6. Because there 
will be a pointer to the function. 
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void i ncrease_value () { a++; } 
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the table! I do really have a deep understanding of this, I just forgot. 
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Ah, of course, it has a table of function pointers and only needs one pointer to 
the table! I do really have a deep understanding of this, I just forgot. 
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Actually, on my machine this code still prints 12. 




Huh? Probably some weird optimization going on, 
perhaps because the functions are never called. 
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On you machine? I guess 12 again? ^ ^ Ok, why? 




Because adding member functions like this does not change the size of the 
struct.The object does not know about it's functions, it is the functions that 

know about the object. 
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On you machine? I guess 12 again? ^ ^ Ok, why? 




J 



Because adding member functions like this does not change the size of the 
struct.The object does not know about it's functions, it is the functions that 

know about the object. 



If you rewrite this into C it becomes obvious. J 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

C++ 

void set_value ( i nt v) { a = 
int get_value() { return a; 
void i ncrease_value () { a++ 

}; 



C++ 
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struct X 
{ 

i n t a ; 
char b; 
i n t c ; 



void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



struct X 
{ 

int a ; 
char b; 
int c ; 



void set_value (struct X * this, int v) { this->a = v; } 
int get_value (struct X * this) { return this->a; } 
void i ncrease_value (struct X * this) { this->a++; } 



C 



Like this? 



C++ 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 



}; 



void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



C 



struct X 
{ 

int a ; 
char b; 
int c ; 

}; 

void set_value (struct X * this, int v) { this->a = v; } 
int get_value (struct X * this) { return this->a; } 
void i ncrease_value (struct X * this) { this->a++; } 



C 



Like this? 





f \ 
Yeah, just like that, and now it is obvious that functions like this do not change 

the size of the type and object. 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

^virtual void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 




#include <iostream> 



r 



struct X 
{ 



}; 



i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



C 



So what happens now? 




The size of the type will probably grow.The C++ standard does not say 
much about how virtual classes and overriding should be implemented, 
but a common approach is to create a virtual table and then you need a 
pointer to it. So in this case add 8 bytes? Does it print 20? 



#include <iostream> 




struct X 
{ 

i n t a ; 
char b; 
i n t c ; 



}; 



virtual void set_value ( i nt v) {a 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



= v; } 



int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



C 



So what happens now? 




r 



The size of the type will probably grow.The C++ standard does not say 
much about how virtual classes and overriding should be implemented, 
but a common approach is to create a virtual table and then you need a 
pointer to it. So in this case add 8 bytes? Does it print 20? 



c 



I get 24 when I run this code snippet 



#include <iostream> 




struct X 
{ 



}; 



i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) {a 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



= v; } 



int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



C 



So what happens now? 




r 



The size of the type will probably grow.The C++ standard does not say 
much about how virtual classes and overriding should be implemented, 
but a common approach is to create a virtual table and then you need a 
pointer to it. So in this case add 8 bytes? Does it print 20? 



^ I get 24 when I run this code snippet 



Ah, don't worry. It is probably just some extra padding to align the pointer J 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
>^ int get_value() { return a; } 
void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 




virtual void set_value ( i nt v) { a = v; } 
int get_value() { return a; } 
void i ncrease_value () { a++; } 



}; 



int main() 
{ 

std::cout << sizeof(X) << std : : endl ; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 




virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 



}; 



int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 




#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 



virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 



int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 



^ So what happens now? 




My guess is that it still prints 24, as you only need one vtable per class. 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 

virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 

}; 

int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 



So what happens now? 



My guess is that it still prints 24, as you only need one vtable per class. 



#include <iostream> 



struct X 
{ 

i n t a ; 
char b; 
i n t c ; 



virtual void set_value ( i nt v) { a = v; } 
virtual int get_value() { return a; } 
virtual void i ncrease_value () { a++; } 



int main() 
{ 

std::cout << sizeof(X) << std::endl; 

} 



^ So what happens now? 





My guess is that it still prints 24, as you only need one vtable per class. 



c 



ok, what is a vtable? 





It is a common implementation technique to support one type of 
polymorphism in C++. It is basically a jump table for function calls, and with it 
you can override functions when doing class inheritance 



let's consider another code snippet... 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



#include "B.hpp" 



class A { 

public: 

A(int sz) { sz_ = sz; v 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz 

}; 



= new B [sz_] ; } 



- i 




/ \ 
Take a look at this piece of code. Pretend like I am a junior C++ programmer 

joining your team. Here is a piece of code that I might present to you. Please 

be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 

teach me something about the C++ way of doing things. 



This is a piece of shitty C++ code. Is this your code? First of all.... 



#include "B.hpp" 




class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

r \ 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



This is a piece of shitty C++ code. Is this your code? First of all.... 



never use 2 spaces for indentation. J 



#include "B.hpp 1 




class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

r \ 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



This is a piece of shitty C++ code. Is this your code? First of all.... 




never use 2 spaces for indentation. J 



The curly brace after class A should definitely start on a new line 



#include "B.hpp 1 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

r \ 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



This is a piece of shitty C++ code. Is this your code? First of all.... 





never use 2 spaces for indentation. J 



The curly brace after class A should definitely start on a new line 



sz_? I have never seen that naming convention, you should always use the GoF 

standard sz or the Microsoft standard m sz. 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



i 

i 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



r 



Do you see anything else? 



#include "B.hpp" 



class A { 

public: 

A(int sz) { sz_ = sz; v 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 



= new B [sz_] ; } 



r 



eh? 



Do you see anything else? 




\ 

i 



#include "B.hpp" 




class A { 




public: 




A(int sz) { sz_ = sz; v = new B[sz_] 


; } 


~A( ) { delete v; } 




II ... 




private: 




// . . . 




B * v; 




int sz_; 

ft 





Do you see anything else? 
V 




Are you thinking about using 4 delete[]' instead of delete' when 
deleting an array of objects? Well, I am experienced enough to know 

that it is not really needed, modern compilers will handle that. 
J 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



Do you see anything else? 



/ \ 

Are you thinking about using 4 delete[]' instead of delete' when 
deleting an array of objects? Well, I am experienced enough to know 
that it is not really needed, modern compilers will handle that. 




Ok? What about the "rule of three"? Do you need 
to support or disable copying of this object? 



#include "B.hpp 1 



class A { 

public: 

A(int sz) { sz_ = sz; v 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 



= new B [sz_] ; } 



Do you see anything else? 



/ \ 

Are you thinking about using 4 delete[]' instead of delete' when 
deleting an array of objects? Well, I am experienced enough to know 
that it is not really needed, modern compilers will handle that. 





Ok? What about the "rule of three"? Do you need 
to support or disable copying of this object? 




Yeah, whatever... never heard of the tree-rule but of course if people copy this 
object they might get problems. But I guess that is the spirit of C++... give 

programmers a really hard time. 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



i 

i 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 




/ \ 

And by the way, I guess you know that in C++ all destructors should 
always be declared as virtual. I read it in some book and it is very 
important to avoid slicing when deleting objects of subtypes. 




#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



/ \ 

And by the way, I guess you know that in C++ all destructors should 
always be declared as virtual. I read it in some book and it is very 
important to avoid slicing when deleting objects of subtypes. 



or something like that... 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



/ \ 

And by the way, I guess you know that in C++ all destructors should 
always be declared as virtual. I read it in some book and it is very 
important to avoid slicing when deleting objects of subtypes. 




another ice cream perhaps? 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



Oh, where should I start... let's focus on the most important stuff first 



#include "B.hpp 1 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

r \ 

Take a look at this piece of code. Pretend like I am a junior C++ programmer 
joining your team. Here is a piece of code that I might present to you. Please 
be pedantic and try to gently introduce me to pitfalls of C++ and perhaps 
teach me something about the C++ way of doing things. 



Oh, where should I start... let's focus on the most important stuff first 




/ \ 

In the destructor. If you use operator new[] you should destroy with operator 
del ete[]. With operator delete[] the allocated memory will be deallocated after the 
destructor for every object in the array will be called. Eg, as it stands now, the B 
constructor will be called sz times, but the B destructor will only be called once. In 
this case, bad things will happen if B allocates resources that need to be released in 

its destructor. 



#include "B.hpp 1 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

And the next thing is the often referred to as the "rule of three". If you need a 
destructor, you probably also need to either implement or disable the copy 
constructor and the assignment operator, the default ones created by the 

compiler are probably not correct. 



#include "B.hpp 1 




class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

And the next thing is the often referred to as the "rule of three". If you need a 
destructor, you probably also need to either implement or disable the copy 
constructor and the assignment operator, the default ones created by the 

compiler are probably not correct. 



r 




A perhaps smaller issue, but also important, is to use the member initializer list to 
initialize an object. In the example above it does not really matter much, but when 
member objects are more complex it makes sense to explicitly initialize the 
members (using the initalizer list), rather than letting the object implicitly initialize all 
its member objects to default values, and then assign them some particular value. 



#include "B.hpp 1 




class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A( ) { delete v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 

And the next thing is the often referred to as the "rule of three". If you need a 
destructor, you probably also need to either implement or disable the copy 
constructor and the assignment operator, the default ones created by the 

compiler are probably not correct. 



r 




A perhaps smaller issue, but also important, is to use the member initializer list to 
initialize an object. In the example above it does not really matter much, but when 
member objects are more complex it makes sense to explicitly initialize the 
members (using the initalizer list), rather than letting the object implicitly initialize all 
its member objects to default values, and then assign them some particular value. 



Please fix the code and I will tell you more... 



#include "B.hpp" 



class A { 

public: 

A(int sz) { sz_ = 
~A( ) { delete v; 

// ... 

private: 

// ... 

B * v; 
int sz_; 

}; 



v = new B [sz_] ; } 



#include "B.hpp" 

class A { 

public: 

A(int sz) { sz_ = 
-AO { delete [] v; 

// ... ^ 

private: 

// ... 

B * v; 
int sz_; 

}; 



= new B [sz_] ; } 



#include "B.hpp" 

class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
-AO { delete [] v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 

class A { 

public: 

A(int sz) { sz_ = sz; v 
-AO { delete [] v; } 

// ... 

private: 

// ... 

B * v; 
int sz_; 



= new B [sz_] ; } 



#include "B.hpp" 



class A { 

public: 

A(int sz) { sz_ = sz; v 
~A() { delete [] v; } 

// ... 

private: 



// ... 

B * v; 
int sz_; 

}; 



= new B [sz_] ; } 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A() { delete [] v; } 

// ... 

private: 

^A(const A &) ; 

^A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 



}; 




#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
^ ~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
^virtual ~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 

}; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
virtual ~A() { deletet] v; } 

// ... 

private: 
A(const A &) ; 
A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 

}; 

nah, nah, nah... hold your horses 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
virtual ~A() { deletet] v; } 

// ... 

private: 
A(const A &) ; 
A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 

}; 




nah, nah, nah... hold your horses 



What is the point of having a virtual destructor on a class like this? 
There are no virtual functions so it does not make sense to inherit 
from it. I know that there are programmers who do inherit from 
non-virtual classes, but I suspect they have misunderstood a key 
concept of object orientation. I suggest you remove the virtual 
specifier from the destructor, it indicates that the class is designed 
to be used as a base class - while it obviously is not. 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
virtual ~A() { deletet] v; } 

// ... 

private: 
A(const A &) ; 
A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 

}; 




nah, nah, nah... hold your horses 



What is the point of having a virtual destructor on a class like this? 
There are no virtual functions so it does not make sense to inherit 
from it. I know that there are programmers who do inherit from 
non-virtual classes, but I suspect they have misunderstood a key 
concept of object orientation. I suggest you remove the virtual 
specifier from the destructor, it indicates that the class is designed 
to be used as a base class - while it obviously is not. 



why don't you fix the initializer list issue instead 
V / 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
^virtual ~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 

}; 



#include "B.hpp" 



class A { 
public: 

A(int sz) { sz_ = sz; v = new B[sz_]; } 
^ ~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
int sz_; 



#include "B.hpp" 



class A { . 
public: V 

A(int sz) { sz_ = sz; v = new B[sz_]; } 

~A() { delete [] v; } 

// ... 

private: 

A(const A &) ; 

A & operator=(const A &) ; 

// ... 

B * v; 
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int sz_; 

}; 

ouch... but do you see the problem we just introduced? 
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Without warning flags you might not notice the mistake here. But if you 
increase the warning levelsl it will scream the problem in your face... 
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int sz_; 
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ouch... but do you see the problem we just introduced? 



Are you compiling with -Wall? You should consider -Wextra 

pedantic and -Weffc++ as wel 



Without warning flags you might not notice the mistake here. But if you 
increase the warning levelsl it will scream the problem in your face... 



A nice rule of thumb is to always write the member initializers in the order they are defined. In 
this case, when v(new B[sz_J) is evaluated sz_ is undefined, and then sz_ is initialized with sz. 

Actually, these things are just too common in C++ code. 
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Now this looks better! Is there anything else that needs to be 
improved? Perhaps some small stuff that I would like to mention... 
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When I see bald pointers in C++ it is usually a bad sign. A lot of good C++ 
programmers tend to avoid using them like this. In this case of course, it 
looks like v is a candidate for being an STL vector or something like that. 
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// ... 
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Now this looks better! Is there anything else that needs to be 
improved? Perhaps some small stuff that I would like to mention... 



When I see bald pointers in C++ it is usually a bad sign. A lot of good C++ 
programmers tend to avoid using them like this. In this case of course, it 
looks like v is a candidate for being an STL vector or something like that. 



You seem to use different naming conventions for private member variables, but as long as it is 

private stuff I think you can do whatever you want. But I guess either postfixing all member 
variables with _ is fine, so is prefixing with m_, but you should never just prefix with _ because 
you might stumble into reserved naming conventions for C, Posix and/or compilers. 
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So what is it that she seems to understand better than most? 




• the connection between C and C++ 

• some techniques for polymorphism 

• how to initialize objects properly 

• rule of three 

• operator new[] and delete [] 

• common naming conventions 



We'd like to share some things about: 




Object lifetime 
The rule of three 
The vtable 



proper object initialization 

assignment is not the same as initialization 



bad style 
A() 

A (int) 

~A() 

~A() 

good style 
A (int) 
~A() 



struct A 
{ 

A() { puts ("A() ") | } 

A(int v) { puts ("A(int) ") ; } 
-A() { puts ("4() ") | } 

In- 
struct X 
{ 

X(int v) { a=v; } 
X (long v) : a (v) { } 
A a; 

}; 

int main ( ) 
{ 

puts ("bad style"); 

{ X slow (int (21 1 1 } 
puts ("good style"); 

{ X fast (long (2) ) $ } 

} 



object lifetime 



A basic principle of C++ is that the operations performed when 
an object's life ends are the exact reverse of the operations 
performed when the object's life starts. 



A() { puts ("A() ■ | j } 
-AO { puts ("-AO "| J } 

In- 



struct B 



}; 



B() { puts ("BO ") ; } 
B() { puts ("~B() ") ; } 




struct C 



A a; 
B b; 



}; 



object lifetime 



A basic principle of C++ is that the operations performed when 
an object's life ends are the exact reverse of the operations 
performed when the object's life starts. 



struct A 
{ 

A() : id(count++) 



}; 



{ 



printf ( M A(%d) \n", id) ; 



-A() 



printf ( M ~A(%d) \n", id) ; 

} 

int id; 

static int count; 



int main ( ) 
{ 

A array [ 4 ] ; 

} 



A(0) 

A(l) 

A(2) 

A(3) 

~A(3) 

~A(2) 

~A(1) 

~A(0) 



object lifetime 



A basic principle of C++ is that the operations performed when 
an object's life ends are the exact reverse of the operations 
performed when the object's life starts. 



struct A 
{ 

A() : id(count + +) 

{ 

printf ("A (%d) \n", id) ; 

} 

~A0 

{ 

printf ("~A(%d) \n", id) ; 

} 

int id; 

static int count; 

}; 



int main ( ) 
{ 

A * array = new A [4]; 
delete [ ] array; 

} 



int main ( ) 
{ 

A * array = new A [4]; 
delete array; 

} 



A(0) 

A(l) 

A(2) 

A(3) 

~A(3) 

~A(2) 

~A(1) 

~A(0) 



A(0) 
A(l) 
A(2) 
A(3) 
~A(0) 



The Rule ofThree 

class defines a copy constructor 




public : 



wibble_ptr ( ) 

: ptr (new wibble) , count (new int(l)) { 



} 



wibble_ptr (const wibble_ptr & other) 

: ptr (other .ptr) , count (other . count ) { 
( * count ) ++; 



} 



private : 



wibble * ptr; 
int * count; 




The Rule of Three 



If a class defines a copy constructor, a copy assignment 
operator 



class wibble_ptr { 
public : 

wibble_ptr ( ) 

: ptr (new wibble) , count (new int(l)) 

} 




wibble_ptr & operator= (const wibble_ptr & rhs) 
wibble_ptr copy (rhs); 
swap (copy) ; 
return *this; 

} 



private : 

wibble * ptr; 
int * count; 



The Rule of Three 



If a class defines a copy constructor, a copy assignment 
operator, or a destructor 




public : 



wibble_ptr ( ) 

: ptr (new wibble) , count (new int(l)) { 



} 



~wibble_ptr () { 



if ( — (*count) == 0) 
delete ptr; 



} 



private : 



wibble * ptr; 
int * count; 



}; 




The Rule of Three 



If a class defines a copy constructor, a copy assignment 
operator, or a destructor, then it should define all three. 



class wibble_ptr { 
public : 

wibble_ptr ( ) 

: ptr (new wibble) , count (new int(l)) { 




wibble_ptr (const wibble_ptr & other) 

: ptr (other .ptr) , count (other . count ) { 
( * count ) ++; 

} 

wibble_ptr & operator= ( const wibble_ptr & rhs) { 
wibble_ptr copy (rhs); 
swap (copy) ; 
return *this; 

} 

~wibble_ptr () { 

if ( — (*count) == 0) 



delete ptr; 

} 

• • • 

private : 

wibble * ptr; 
int * count; 



The vtable 



struct base 
{ 

virtual void f(); 
virtual void g(); 
int a,b; 

In- 
struct derived : base 
{ 

virtual void g(); 
virtual void h(); 
int c; 

}; 

void poly (base * ptr) 
{ 



ptr->f () | 
ptr->g ( ) ; 



} 



int main ( ) 

{ 

poly ( &base ( ) ) ; 
poly ( &derived ( ) ) ; 

} 
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base::f() {} 
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The vtable 



struct base 
{ 

void f||j 
virtual void g(); 
int a,b; 

} I 

struct derived : base 
{ 

virtual void g(); 
virtual void h(); 
int c; 

} I 

void poly (base * ptr) 
{ 

ptr->f () ; 
ptr->g () i 

I 

int main ( ) 

{ 

poly ( &base ( ) ) ; 
poly ( &derived ( ) ) ; 

} 



base::f() {} 




0 




base::g() {} 



base 
vtable 



base object 



vptr 




derived 
vtable 



derived::g() {} 



derived ::h() {} 



derived object 



Would it be useful if more of your colleagues have a deep 
understanding of the programming language they are using? 




We are not suggesting that all your C and C++ programmers 
in your organization need a deep understanding of the 
language. But you certainly need a critical mass of 
programmers that care about their profession and constantly 
keep updating themselves and always strive for a better 
understanding of their programming language. 



Let's get back to our two developers... 
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It is their attitude to learning! 
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am learning by doing. What more do you need? 
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What do you mean? I learned programming at university and now I 
am learning by doing. What more do you need? 



r 



So what kinds of books are you reading? 




Books? I don't need books. I look up stuff on internet when I need it. 
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Do you discuss programming with your colleagues? 



They are all stupid, I have nothing to learn from them... 
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You seem to know a lot about C and C++? How come? 



I am learning new things every day, I really enjoy it. 
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came out with a great book about Test-Driven Development in C? 
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I read books. Lots of books. Did you know that James Grenning just 
came out with a great book about Test-Driven Development in C? 



I have to admit that I visit WG 1 4 and WG2 1 once in a while 



I am a member of ACCU, for those who care about professionalism in 
programming, I read Overload, C Vu and discussions on accu-general 



And whenever I get a chance I attend classes teaching C and C++. It not 
always because I learn so much from the slides and the teacher, it is often 
through discussions with other learners that I expand my knowledge. 
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I am learning new things every day, I really enjoy it. 
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I occasionally follow C and C++ discussions on stack overflow, 

comp.lang.c and comp.lang.c++ 
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I am a member of a local C and C++ Users Group, we have 

meetings once in a while 




I read books. Lots of books. Did you know that James Grenning just 
came out with a great book about Test-Driven Development in C? 



I have to admit that I visit WG 1 4 and WG2 1 once in a while 



I am a member of ACCU, for those who care about professionalism in 
programming, I read Overload, C Vu and discussions on accu-general 



And whenever I get a chance I attend classes teaching C and C++. It not 
always because I learn so much from the slides and the teacher, it is often 
through discussions with other learners that I expand my knowledge. 



But perhaps the best source of knowledge is working closely with my colleagues and 
try to learn from them while contributing with my knowledge. 
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• compiler and linker 

• declaration vs definition 

• activation frame 

• memory segments 

• memory alignment 

• sequence points 

• evaluation order 

• undefined vs unspecified 

• optimization 

• something about C++ 

• proper initialization of objects 

• object lifetimes 

• vtables 

• rule of 3 

• ... and something about attitude and professionalism 




Eh? 
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Yes? 





I really love programming, but I realize now that perhaps I am not 
behaving as a true professional. Any advice on how to get started to 

get a deep knowledge of C and C++? 





I really love programming, but I realize now that perhaps I am not 
behaving as a true professional. Any advice on how to get started to 

get a deep knowledge of C and C++? 



First of all you must realize that programming is a continuous learning 
process, it does not matter how much you know, there is always much 
more to learn. The next thing to realize is that professional 
programming is first of all a team activity, you must work and develop 
together with your colleagues.Think about programming as a team 
sport, where nobody can win a whole match alone. 
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I really love programming, but I realize now that perhaps I am not 
behaving as a true professional. Any advice on how to get started to 

get a deep knowledge of C and C++? 
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First of all you must realize that programming is a continuous learning 
process, it does not matter how much you know, there is always much 
more to learn. The next thing to realize is that professional 
programming is first of all a team activity, you must work and develop 
together with your colleagues.Think about programming as a team 
sport, where nobody can win a whole match alone. 




Ok, I need to think about that... 



Having said that. Make it a habit to once in a while take a look at the assembly 
output actually produced by snippets of C and C++. There are a lot of surprising 
things to discover. Use a debugger, step through code, study how memory is used and 
^ook at the instructions actually executed by the processor. 



Any books, sites, courses and conferences about C and C++ you 

would like to recommend? 



Any books, sites, courses and conferences about C and C++ you 

would like to recommend? 
/ 

To learn modern ways of developing software, I recommend "Test-Driven 
Development for Embedded C" by James Grenning. For deep C 
knowledge, Peter van der Linden wrote a book called "Expert C 
programming" two decades ago, but the content is still quite relevant. For C 
++ I recommend you start with "Effective C++" by Scott Meyers and "C++ 
coding standards" by Herb Sutter and Andrei Alexandrescu. 
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And finally, I would recommend that you get yourself involved in user groups and 

communities for programmers. In particular I would recommend the ACCU 

(accu.org), they are very much focused on C and C++ programming. Did you 

know they host a conference in Oxford every spring where professional 

programmers from all around the world meet to discuss programming for a 

week? Perhaps I see you there in April next year? 
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